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Abstract 

In this paper, we study dependence coefficients for copula-based Markov cliains. We provide new 
tools to check the convergence rates of copula-based Markov chains. We also provide a necessary 
condition for Markov chains from the Metropolis-hastings algorithm to be exponential /o-mixing. A 
general necessary condition on symmetric copulas to generate exponential p-mixing is given. At the 
end of the paper, we comment and improve some of our previous results on mixtures of copulas. 

Key words: Markov chains, copula, mixing conditions, reversible processes, ergodicity, 
Metropolis-hastings. 

AMS 2000 Subject Classification: Primary 60J20, 60J35, 37A30. 

1 Introduction 

This work is motivated by a paper by Xiaohong Chen, Wei Biao Wu and Yanping Yi (2009) and 
a result of B.K. Beare (2010). In the paper ^15j . Xiaohong Chen and al. have shown that Markov 
processes generated by the Clayton, Gumbel or Student copulas are geometrically ergodic. They 
used in their paper quantile transformations and small sets to show geometric ergodicity, but could 
not handle for instance the mixture of these copulas. In the paper [7], M. Longla and al. have 
shown that these examples are actually exponential p- mixing. We have also answered the open 
question on geometric ergodicity of convex combinations of geometrically ergodic reversible Markov 
chains. In a recent paper, M. Longla (2012) obtained sufficient conditions for exponential p-mixing 
rate, improving a previous result of Beare (2010). 

Quantifying the dependence among two or more random variables has been an enduring task for 
statisticians. Copulas are full measures of dependence among components of random vectors. Unlike 
marginal and joint distributions, which are directly observable, a copula is a hidden dependence 
structure that couples a joint distribution with its marginals. An early statistical application of 
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copulas was given by Clayton (1978) [3j, where the dependence between two survival times in a 
multiple events study is modeled by the so-called Clayton copula 

C(x,y) = (x-" + ?/~"-l)-i/" a>0. 

The literature on copulas is growing fast; an excellent overview, guide to the literature and appli- 
cations belongs to Embrechts and al. (2003) ||5]. In later research into copulas, a driving force has 
been in financial risk management for modeling dependence among different assets in a portfolio. 
Nelsen's monograph (2006) [lOj can be regarded as one of the best books for an introduction to 
copulas. We also define a new list of copula with functional parameters, that can be used in various 
model. 

1.1 Definitions 

1.1.1 2-Copulas 

A 2-copula is a function C : [0, 1] x [0, 1] ^ [0, 1] = / for which C(0, y) = C{x, 0) = (C is 
grounded), C{l,x) = C{x, 1) = x(each coordinate is uniform on /) and for all [xi, X2; yi, 2/2] C I^, 
C{xi,yi) + C{x2,y2) — C{xi,y2) — C{x2,yi) > 0. The definition doesn't mention probability, but 
in fact these conditions imply that C is the joint cumulative distribution function of two random 
variables with margins uniform on /. It follows from the definition (see |10| chapter 1]) that the 
function C is non-decreasing in each of the parameters, has partial derivatives a.e with values 
between and 1. The partial derivative of C are non-decreasing in the other parameter. A convex 
combination of 2-copulas is a 2-copula. 

If Xi,X2 are random variables with joint distribution F and marginal distributions Fi,F2, 
then the function C{x,y) defined by C{Fi(xi), F2{x2)) = F{xi,X2) is a copula. Moreover, if the 
random variables are continuous, then the copula is uniquely defined by the joint distribution and 
the marginal distributions by the formula F{F^^ (xi) , F2^ {X2)) = C{xi,X2)- This fact is known as 
Sklar's theorem. The implication of the Sklar's Theorem is that, after standardizing the effects of 
marginals, the dependence among components of X is fully described by the copula. Indeed, most 
conventional measures of dependence can be explicitly expressed in terms of the copula. 

1.1.2 Copulas and Markov processes 

Copulas have been shown to be a more flexible way to define a Markov process, as in the case when 
one suspects that the marginal distributions of the states are not related to the distribution of the 
initial state. Using copulas will allow changes in single marginal distributions, without having to 
change all other distributions in the chain. 

A stationary Markov chain (X„, n E Z) can be defined by a copula (C(x, y)) and a one dimen- 
sional marginal distribution. For stationary Markov chains with uniform marginals on [0,1], the 
transition probabilities for all n S Z are P{Xn € = x) = C i(x, y) for sets A = (—00, y] (for 

more details, see [4, theorem 3.1]). Here, C 1(2;, y) is the derivative of C(x, y) with respect to x. This 
relationship was used in [15j to show that stationary Markov processes defined by the Clayton, Gum- 
bel or Student copulas are geometrically ergodic. These notions will be defined later on. We will 
use in this paper the following conventional notation: \\g\W = Jj g^{x)dx, A^i{xi,X2) = 
c{x,y) will be sued for the density of C{x,y). 
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1.1.3 Dependence coefficients 

Many dependence coefficients have been studied in the Hterature, such as a„, /3n, (t>n among 
others. In our paper, we wih mainly use the last 3 coefficients, that are defined as follows: 
Given a-fields .ssf 

=E sup \P{B\£/) - P{B)\ 
p{£/,^)= sup corr{f,g), 

/GL2K),96L2(.i^) 

^{£/,^)= sup \P{B\A) - P{B)\ 

Be,«,AG£/,P(A)>0 

Given the alternative form of the transition probabilities for a Markov chain generated by a copula 
and a marginal distribution with positive density, it was shown in [7J that these coefficients have the 
following simple form when the copula is absolutely continuous with density c, ^ = (j{Xi,i < 0) 
and = a(Xi,i > n): 

/3„ = / sup I / {cnix,y) - l)dy\dx, 
Jo Bci Jb 

= sup ess sup I / {cn{x,y) - l)dy\, 
Bci oc Jb 

/9„ = sup{ / / c„(x,y)/(x)5(y)dxdy : II5II2 = II/II2 = 1,1E(/) = E(5() = 0}. 
Jo Jo 

Here, Cn is the density of the random variable [Xq, X^)- In general the following inequalities hold 
(see [21 theorem 7.4, 7.5] for more.): 

/3n <</> n 5 Pn 

< 2v^, Pn < [piT. (1) 

These coefficients are defined to assess the dependence structure of the Markov process, and provide 
necessary conditions for CLT and functional CLT and their rates of convergence for the statistics 
of the studied model. Some examples can be found in |12) and the references therein. A 

stochastic process is said to be respectively an a-mixing (/3- mixing or p- mixing), if a„ — >■ 
(/3n — > or /)„ ^ 0). The process is exponentially mixing, if the convergence rate is exponential. 
A stochastic process is said to be geometrically ergodic, if for some a G (0, 1) , and n € N, 

sup \P{Xn G B\Xo = x)- P{Xn G B)\ < a"H^(x), with E{W{Xo)) < 00. 
Bex 

By theorem 2.1 of Nemmelin and Tuominen (1982), geometric ergodicity is equivalent to exponential 
(3-mixing. So, whenever we encounter the term, we may understand either of the two concepts. 
The main results on this topic are due to Darsow, Nguyen and Olsen (1992), de la Peha, Ibragimov 
and Sharakhmetov (2006) and Ibragimov (2009), who were among the first to study copula-based 
Markov processes. They characterized copula-based Markov chains. Joe (1997) proposed a class 
of parametric (strictly) stationary Markov models based on parametric copulas and parametric 
invariant distributions. Their setup of the problem was modified by Chen and Fan (2006), who 
studied a class of semi-parametric stationary Markov models based on parametric copulas and 
nonparametric invariant distributions and analyzed the strength of dependence in the Markov 
chain. They showed that the temporal dependence measure is fully determined by the properties 
of the copulas. Studying the strength of the dependence, Beare (2008) provided simple sufficient 
conditions for geometric /3-mixing in terms of copulas without any tail dependence. 
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The tail dependent copulas are the most popular copulas because they suitably model shocks 
in economics. In an effort to address this issue, Ibragimov and Lentzas (2008) demonstrated via 
simulation that Clayton copula-based first-order strictly stationary Markov models could behave 
as "long memory" in copula levels (meaning that dependence on the initial distribution tends to 
persist). But, Xiaohong Chen, Wei Biao Wu and Yanqing Yi (2009) showed that this simulation is 
misleading, because the copula-based models studied by Ibragimov are actually geometric /3-mixing. 
In conclusion, although a time series plot may look highly persistent and "long memory alike," the 
time series may still be weakly dependent and "short memory " so simulating isn't a good way to 
handle the issue. Beare (2010) provided the proof based on simulation, that the Clayton copula 
provides exponential rho-mixing. The theoretical proof of this fact and many others was given by 
M. Longla and M. Peligrad (2012) |7]. 

In this paper we derive the copula of the Metropolis-hastings algorithm for a general chosen 
proposal, and use one of our previous results to provide a condition for exponential /)-mixing (or 
say a way to choose the proposal to guaranty exponential rate of convergence). We then have some 
remarks on the rate of convergence of Markov chains generated by convex combinations of copulas. 
These results are applied to the Frechet and Mardia families of copulas. 

The paper is structured as follows: In section two below we provide new results on exponential 
/3-mixing, and exponential /3-mixing for some families of Metropolis-Hastings algorithms. In section 
3 we comment and improve some of our results from |7j on convex combinations of copulas and 
apply them to some families of copulas. 



2 Mixing rates of the Metropolis Hastings Algorithm 



We shall recall that exponential p-mixing is equivalent to /9i < 1 for Markov chains. This fact will 
be used in this section to improve some known results. 

2.1 Introduction to the Metropolis Hastings algorithm 

The M-H algorithm is one of the tools that statisticians use to simulate data from not completely 
known distributions; especially when the distributions are known up to the normalizing constant 
or when it is impossible to have i.i.d samples from the given distributions. The investigator is 
therefore looking for a Markov chain that converges to an invariant distribution equivalent to the 
incomplete one in hands. A sample from the stationary phase of this Markov chain is then used 
to study questions related to the initial unknown density. In order to achieve this goal, one should 
know the convergence rates of the Markov chain, that allows approximation of the start of the 
stationary phase of the chain. The mixing rates are important also for inference using statistics 
of the simulated data (they provide corresponding limit theorems for inference and testing). All 
these properties are highly related to the choice of the proposal for the simulation process. We will 
derive here the formula of the copula of these processes for some cases of proposals, that one can 
use along with other theorems in this dissertation to choose a proposal. 

Let / be the invariant density we want to generate observations from using the M-H algorithm. 
Assume / is strictly positive on its domain with cumulative distribution function F. Let q[x, y) 
be the proposal defined and positive on the domain of / (q is a density for each fixed value of x). 
Define, 



Start the chain by a value in the domain of the distribution of interest, then sample potential new 
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states from the proposal distribution q which must be easy to sample from, and depends on the 
current state of the chain. More precisely, if the chain is initialized at Xq = xq, then at any time 
t > 1 the Metropolis-Hastings algorithm performs the following steps: 

1. Generate an observation y from q, using as parameter, 

2. Compute the acceptance ratio a(x, y) defined above, 

3. With probability a(x,y), set Xt = y and otherwise Xt = Xt-i- 

Implementation of step 3 requires generating an observation u from the uniform distribution on 
[0,1], then setting Xt = y ii a{x,y) < u, otherwise Xt = Xt-i- It is therefore easy to derive the 
transition kernel of the obtained Markov chain: 



We obtain an independent M-H algorithm by choosing an independent proposal (which is in- 
dependent of the current state q{y))- Popular variants of the M-H algorithm are the random walk 
Metropolis (RWM) in which y = x + e, where e is generated from a spherically symmetric distribu- 
tion, e.g., £'^N(0; S), and the independent Metropolis (IM) defined above. Generally, the RWM is 
used in situations in which we have little idea about the shape of the target distribution and there- 
fore we need to "totter" through the sample space. The opposite situation is one in which we have 
a pretty good idea about the target density and we are able to produce a credible approximation q 
that we use as proposal in the IM algorithm. 

The speed and modes of convergence of the M-H chains have been studied by Mengersen and 
Tweedie (1996) [9j, Roberts and Rosenthal |14) , Jarner [6] and others. Despite understanding 
theoretically quite well the general convergence properties of the M-H algorithms, in practical 
implementations the user is left with the difficult task of determining and tuning an appropriate 
proposal distribution. For instance, in the case of a RWM one has to choose carefully the variance 
S so that a good balance is achieved between the acceptance rate and the chain's autocorrelation 
function. Similarly, in the case of an IM algorithm, one needs to find a distribution q that satisfies 
two non-trivial conditions: 

1) it approximates reasonably well the target distribution and 

2) it can be easily sampled from. 

In practice, the process of determining a good proposal requires a back-and-forth strategy in 
which one starts with an initial proposal and subsequently makes a number of modifications while 
trying to assess their influence on the performance of the algorithm. This "tune-up" requires restart- 
ing the simulation process a number of times and can be time-consuming and is often frustrating. 
The difficulties are amplified when the target distribution has support in a high dimensional space. 
We propose here a way to avoid such problems, by choosing the "right" proposal to begin with. 
To simplify the formula of a, statisticians often use a symmetric proposal. For a general proposal 
q{x,y), the following holds. 

Proposition 1 The copula of the Metropolis-Hastings algorithm with a proposal q{x, y) and an 
invariant distribution f with cumulative distribution F is given by 



P{x, dy) = q{x, y)a{x, y)dy + (1 



q{x,y)a{x,y)dy)6-^{dy) 



(2) 



C{u,v) = AC{u,v) + SC{u,v) 



AC{u,v) = 




a{F~\t),F-\z))dzdt 



L 



1 



q{F-\t),F-\z)) 



SC{u,v) = [1 



a{F''^{t),F-'^iz))dz] dt. 
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Proof. Using the definition of the copula from the Sklar's Theorem, and the relationship with the 
transition probability given in the Introduction, we obtain 

Ci(F(x),F(y)) = P(x,(-oo,y]) = f P{x,dt). 

J — oo 

Thus, using the chosen proposal and a simple change of variable, we obtain 

/F~^{v) i-oo 
a{F-^{u),t)q{F-^{u),t)dt + {1 - / a{F-^{u),t)qiF-^{u),t)dt)I{u < v) 
-oo J — oo 

Of course all this takes into account the fact that the given derivative acts like a distribution 
function (or in other words like Sobolev derivatives). Using simple facts from these theories, simple 
integration, the change of variable inside the integral (z = F{t)) and uniqueness of the copula lead 
to: 



AC{u,v) = £ ^^^^^^^^^^^ (3) 

■ 

AC is called the absolute continuous part of C and SC is the singular part of C. The copula is 
said to be absolutely continuous if SC{u,v) = for all {u,v) € [0, 1]^. 

For a strictly positive symmetric proposal q{x, y), we have a simplifies and for the stated example 
of proposal we obtain: 

AC(u V) - r r ^(^-'(')MF-Hz)) ^^ JjF-Hz)) ^.^^^^ 
"^^^"'"^ " Jo Jo fiF-Hz)) ^^"^ /(F-i(t)) ' 



Jo 
In view of all these formulas we have the following: 

Lemma 2 The copula of the Metropolis-Hastings algorithm with a strictly positive symmetric pro- 
posal is absolutely continuous if and only if 

q{F-\t),F-Hz)) . J{F-\z)) 

1 - / ^^ mmj-— — — -, Ijdz = for almost all t. 

Jo [z)) f[r '-[t)) 

2.2 Mixing rates of the Metropolis-Hastings based on the proposal 

As seen before, the choice of the proposal is crucial in the convergence of the MH algorithm. Here, 
we provide some conditions to measure ahead of time the rate of convergence for various classes of 
MH algorithms. The Independence Metropolis-Hastings algorithm is defined by q{x,y) = q{y). 

Theorem 3 The Independence Metropolis-Hastings generates a geometrically ergodic and exponen- 
tial p-mixing Markov chain if for almost all y, q{y) > f3f{y), P ^ 0. 

A related theorem was proved by Mengersen and Tweedie (1996) |9j. We provide here a proof of 
this statement based on our previous results from [8]. 
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Theorem 4 // there exists nonnegative functions ei, £2 defined on [0,1] for which the density of 
the absolute continuous part of the copula denoted c(x, y) satisfies the inequality 

c{x,y) > ei{x) + e2{y) 

with El, £2 € Li[0, 1] such that at least one of the two functions has a non-zero integral, then the 
process is exponential p-mixing, and thus exponential ^-mixing if the density appears to he positive 
on a set of measure 1. 

Proof. The density of the absolute continuous part of the copula of the metropolis-Hastings 
algorithm is 

For the independence M-H algorithm with the given property, we obtain 

Therefore, by TheoremH]from |[8j, the chain is exponential p mixing. This chain being also absolutely 
regular (positive density of absolute continuous part of the copula on a set of measure 1), we can 
conclude that the chain is geometrically ergodic by Corollary in [7]. ■ 
A similar theorem holds for a non-symmetric proposal q{x,y). 

Theorem 5 If the proposal q{x,y) is such that q{x,y) > (3f{y) for all y, then the M-H is geomet- 
rically ergodic and exponential p-mixing. 

The proof is similar to the above one. 

Remark 6 Mengersen and Tweedie showed in their work 19] that the fact that without this hound 
on the ratio in the case of the independent algorithm the chain may tend to "stick" in regions of 
low density is of considerable practical importance and is not merely a curiosity: seemingly sensible 
procedures give this behavior. They provided an example to illustrated this fact. 



3 On mixing rates of mixtures copula-Based Markov 
chains 

3.1 Convex combinations or mixtures of copulas 

Mixing coefficients for copula-based Markov chains have been widely studied recently. We have 
provided some results based on properties of the copulas in recent papers. These results have some 
implications that we shall stress here. 

Theorem 7 

Let {Ck{x,y); 1 < k < n) ben copulas. Then, 

1. if Ci{x,y) generates exponential p-mixing, then any convex combination of these copulas that 
contains Ci{x,y) generates exponential p-mixing stationary Markov chains. 
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2. if one of the copulas in the combination generates an absolutely regular stationary Markov 
chain, and Ci{x,y) exponential p-mixing stationary Markov chain, then any symmetric con- 
vex combination containing Ci{x,y) generates geometrically ergodic and exponential p-mixing 
Markov chains. 

Proof. The proof of the first part of the conclusion is similar to the proof of Theorem 5 of Longla 
and al. (2012) [7|, except for the fact that we don't need symmetry of the copulas. Symmetry [3 in 
Theorem 5] was necessary only for the conclusion of geometric ergodicity. The proof of the second 
part need the reversibility condition on the convex combination, not on all copulas, because what 
matters is that the Markov chain that is generated by the convex combination is reversible. So, all 
arguments of [TJ Theorem 5] work. ■ 

Remark 8 Basically, this theorem states that any convex combination of copulas that contains a 
copula that generates p-mixing will inherit this property. This happens no matter what are the mixing 
properties of other copulas in the combination. Moreover, if this convex combination is symmetric 
(which can happen even when the copulas themselves are not symmetric), then this combination 
will generate geometrically ergodic Markov chains provided that one of the copulas generates an 
absolutely regular Markov chain. 

This is an implication of this result that can be useful in applications. 
Corollary 9 

Assume {Ck{x,y); 1 < k < n) are n copulas and Ci{x,y) has the density of its absolute con- 
tinuous part strictly positive on a set of Lebesgue measure 1. Assume one of them generates a 
p-mixing stationary Markov chain. Then, any symmetric convex combination, afcCfc(2;, y) 

with Yl^=i Ofc = 1) < flj < 1, oi ^ generates a geometrically ergodic Markov chain. 



3.2 Applications 

1. The Mardia family of copulas 

C(x, y) = ^!i^±^M(x, y) + (1 - e^)Pix, y) + ^V (x, y), G [-1, 1], (7) 

where the copulas W and M are the Hoeffding lower and upper bounds respectively and P is 
the independence copula. The formula ([7]) defines the Mardia family of copulas. This family 
of copulas is a convex combination of the copulas M, W and P. 

2. The Prechet family of copulas 

C{x,y) = aM{x,y) + {l-a-P)P{x,y)+^W{x,y) (0<a + /3<l). (8) 

The formula ([8]) defines the Frechet family of copulas. Notice that a Mardia copula is a 
Frechet copula with a + /3 = 6^. So, they will share the same properties. Given that the 
copula P defines geometrically ergodic and p-mixing stationary Markov chains and these 
convex combinations are symmetric, we can conclude by the above Corollary [9] that these 
families generate exponential /3-mixing and geometrically ergodic reversible Markov chains for 
a + /3 / 1 (6*^/1) respectively. 

Acknowledgment 10 The Author thanks his advisor M. Peligrad for useful comments and all the 
help she provides in everything. 
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